Figure 1 



Nucleotide sequence of the Prunus amygdalus HNL5 gene 
obtained by PGR .amplification 

The start codon (ATG) and stop codon of the 
open reading frame are printed in bold type, and the 
nucleotides in the intron regions are indicated in 
lower case letters. The peripheral sequences which have 
been attached via the PCR primers and which are not 
part of the HNL5 gene are underlined. The splice sites 
of the introns were identified with the aid of the 
consensus sequence "GT. . . .AG" . 

1 GGAATTCAC A ATATGGAGAA ATCAACAATG TCAGTTATAC TATTTGTGTT 

51 GCATCTTCTT GTTCTTCATC TTCAGTATTC AGAGGTTCAC TCGCTTGCCA 

101 ATACTTCTGC TCATGgtaaa tttccatctt cagtattcat ttascagcaa 

1 51 aatgtgtaga tttataatta agaaaactga cacaagtagt gcaagaaaca 

201 agctaattta gatgcatgtt gaaaaaaatc tttcatctct tcacatatat 

251 tttgcagATT TTAGCTACTT GAAGTTTGTG TACAACGCCA CTGATACAAC3 

301 CTCGGAAGGA TCATATGACT ACATTGTAAT CGGTGGAGGA ACATCAGGGT 

351 GTCCATTGGC AGCAACTTTA TCAGAAAAAT ACAAGGTGCT TCTTCTAGAA 

401 AGAGGCACTA TTGCTACAGA ATACCCGAAC ACGTTGACTG CAGATGGGTT 

451 TGCATATAAT CTGGAGCAAC AAGATGATGG AAAGACGCCA GTTGAAAGGT 

501 TCGTGTCCGA AG ATG GC ATT GATAATGTGC GAGCCAGGAT CCTCGGTGGC 

551 ACGACCATAA TCAATGCAGG CGTCTACGCC AGAGCTAACA TTTCATTCTA 



601 



TAGTCAAACA GGAATTGAAT GGGACCTGGA TTTGGTCAAT AAGACATATG 
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AGTGGGTTGA AGACGCCATT GTGGTCAAGC CAAATAATCA ATCTTGGCAA 
TCTGTTATAG GAGAGGGATT CTTGGAGGCG GGTATTCTTC CAGACAATGG 
ATTTAGTTTG GATCACGAAG CAGGAACTAG ACTCACCGGC TCAACTTTTG 
ACAATAATGG AACGCGACAT GCGGCTGATG AACTGCTTAA TAAAGGAGAC 
CCTAATAACT TGCTAGTTGC AGTTCAGGCC TCAGTAGAGA AGATCCTCTT 
CTCTTCCAAT ACATCAAgta tgttgcatca gtgatattta atggtagctc 
ctagttlgtc atgctgcact cgaaaattat tattttatca ttttaaaata 
ctaacagaat agtgtgaagt ctcatatttc ccttccatat ttcccaaatt 
tccalaaaca aaacttccca attctccttc gtttagtttg acaataatta 
taagctattc tctaatgcag ATTTGTCAGC TATTGGAGTC ATATATACGG 
ATTCTGATGG AAACTCTCAT CAGGCATTTG TACGCGGTAA CGGAGAAGTT 
ATTGTTAGTG CAGGGACAAT CGGAACGCCT CAGCTTCTAC TACTTAGTGG 
CGTTGGACCA GAGTCTTACC TATCTTCTCT CAACATCACA GTTGTTCAGC 
CGAATCCTTA TGTTGGGCAG TTTGTGTATG ACAATCCTCG TAATTTCATT 
AATATTTTGC CCCCAAATCC AATTGAAGCC TCTGTTGTAA CTGTTTTAGG 
CATTAGAAGT GATTATTATC AAGTTTCTCT GTCAAGCTTG CCATTTTCCA 
CTCCACCCTT TAGTCTT7TT CCTACAACAT CTTACCCCCT CCCAAATTCG 
ACTTTTGCTC ATATTGTTAG CCAAGTTCCA GGACCATTGT CTCATGGTTC 
TGTCACGCTA AATTCATCAT CTGACGTGAG AATCGCTCCA AATATTAAAT 
TCAATTACTA TTCAAATTCC ACAGACCTTG CTAATTGTGT TAGCGG C ATG 
AAGAAGCTTG GTGACTTATT AAGGACAAAG GCATTAGAAC CATATAAAGC 
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1 701 TCG AG ATGTG CTGGGAATTG ACGGTTTCAA TTATTTGGG A GTACCTTTGC 

1 751 CAGAGAACCA AACAGATGAT GCATCCTTCG AAACATTTTG TCTAGATAAT 

1801 GTAGCTTCAT ACTGGCATTA CCACGGTGGA AGCCTTGTTG GGAAAGTGCT 

1 851 TGATG ACAGT TTCCGTGTTA TGGGGATCAA AGCATTACGC GTTGTTGATG 

1 901 CCTCCACTTT CCCTTACG AA CCAAACAGCC ATCCTCAGGG CTTCTATCTG 

1 951 ATGTTAGGAA Ggtatgtgat gcacacttcc aaccactaga gatlctcaat 

Q 2001 attttgttgt tgttgtaatg aactctctgc cgcattgctc ttttttatta 

a 



2051 atccttaaaa ttttgtgttt tgcgcagGTA TGTGGGCCTT CAAATCCTGC 



yj 21 01 AAGAAAGGTC AATCCGGTTG GAGGCTATTC ATAATATTCA AGAGTCCATG 

ru 

L 2151 TGAAGAATTCCG 
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Figure 3 : 



Amino acid sequence of the Prunus amygdalus 

hydroxynitrile lyase (HNL5)-, — derived --from the 

nucleotide sequence of the HNL5 gene . The signal 
sequence determined from sequence analysis is printed 
in bold type and the postulated processing site is 
indicated by an arrow. Possible glycosylation sites 
(PROSITE patterns) are underlined. 

I 

MEKSTMSVILFVLHLLVLHLQYSEVHSLANTSAH 

VIGGGTSGCPLAATLSEKYKVLLLERGTiATEYPNTLTADGFAYNLQQQDDGKTPVE 
RFVSEDGIDNVRARILGGTTIINAGVYAR^^ 

WKPNNQSWQSViGEGFLEAGILPDNGFSLDHEAGTRLTGSTFDN NGTR HAADELL 

NKGDPNNLLVAVQASVEKILFSSNTSNLSAIGVIYTDSDGNSHQAFVRGNGEVIVSA 

GTIGTPQLLLLSGVGPESYLSSLNTTVVQPNPYVGQFVYDNPRNFINILPPNPIEASVV 

TVLGiRSDYYQVSLSSLPFSTPPFSLFPTTSYPLP NSTF AHIVSQVPGPLSHGSVTLN 

S^DVRIAPNIKFNYYSNSTDLANCVSGMKKLGDLLRTKALEPYKARDVLGIDGFNY 

LGVPLPENQTDDASFETFCLDNVASYWHYHGGSLVGKVLDDSFRVMGIKALRWD 

ASTFPYEPNSHPQGFYLMLGRYVGLQJLQERSIRLEAIHNIQESM 



Figure 4 : 



Nucleic acid sequence of the DNA fragment coding for a 
secretory hybrid protein (PamHNL5xGOX) with HNL 
activity, consisting of sequences of the Prunus 
amygdalus HNL5 gene and the Aspergillus niger glucose 
oxidase gene . 

gaattcatcatgcagactctccttgtgagctcgcttgtggtctccctcgctgcggccctgcca- 

cactacatcaggagcaatggcattgaagcctacaacgccactgatacaagctcggaaggatca-* 

tatgactacattgtaatcggtggaggaacatcagggtgtccattggcagcaactttatcagaa- 

aaatacaaggtgcttcttctagaaagaggcactattgctacagaatacccgaacacgtt- 

gactgcagatgggtttgcatataatctgcagcaacaagatgatggaaagacgccagttga- 

aaggttcgtgtccgaagatggcattgataatgtgcgagccaggatcctcggtggcacgacca- 

taatcaatgcaggcgtctacgccagagctaacatttcattctatagtcaaacaggaatt- 

gaatgggacctggatttggtcaataagacatatgagtgggttgaagacgccattgtggt- 

caagccaaataatcaatcttggcaatctgttataggagagggattcttggaggcggg- 

tattcttccagacaatggatttagttfggatcacgaagcaggaactagactcaccggct- 

caacttttgacaataatggaacgcgacatgcggctgatgaactgct(aataaaggagaccc- 

taataacttgctagttgcagttcaggcctcagtagagaagatcctcttctcttccaatacat- 

caaatttgtcagctattggagtcatatatacggattctgatggaaactctcatcaggcatttg- 

tacgcggtaacggagaagttattgttagtgcagggacaatcggaacgcctcagcttctac- 

tacttagtggcgtiggaccagagicttacctatcttctctcaacatcacagttgttcagcc- 

gaatccttatgttgggcagtttgtgtatgacaatcctcgtaatttcattaatattttgcccc- 

caaatccaattgaagcctctgttgtaactgttttaggcattagaagtgattattat- 

caagtttctctgtcaagcttgccattttccactccaccctttagtctttttcctacaacatct- 

taccccctcccaaattcgacttttgctcatattgttagccaagttccaggaccattgtct- 

catggttctgtcacgctaaattcatcatctgacgtgagaatcgctccaaatattaaattcaat- 

tactattcaaattccacagaccttgctaattgtgttagcggcatgaagaagcttggtgact- 

tattaaggacaaaggcattagaaccatataaagctcgagatgtgctgggaattgacggttt- 

caattatttgggagtacctttgccagagaaccaaacagatgatgcatccttcgaaa- 

cattttgtctagataatgtagcttcatactggcattaccacggtggaagccttgttggga- 

aagtgcttgatgacagtttccgtgttatggggatcaaagcattacgcgttgttgatgcctc- 

cactttcccttacgaaccaqacagccatcctcagggcttctatctgatgttaggaagg- ... 

tatgtgggccttcaaatcctgcaagaaaggtcaatgcagtgagcggccgcatgcgaattc 



Figure 5: Amino acid sequence of the hybrid protein 
PamHNLSxGOX, derived fromi the nucleic acid sequence 
(figure 4) . 

MQTLLVSSLWSLAAALPHYIRSNGIEAYNATDTSSEGSYDYIVIGGGTSGCPLAATL 

SEKYKVLLLERGTIATEYPNTLTADGFAYNLQQQDDGKTPVERFVSEDGIDNVRARI 

LGGTTIINAGWARANISFYSQTG1EWDLDLVNKTYEWVEDAIWKPNNQSWQSVIG 

EGFLEAGILPDNGFSLDHEAGTRLTGSTFDNNGTRHAADELLNKGDPNNLLVAVQA 

SVEKILFSSNTSNLSAIGVIYTDSDGNSHQAFVRGNGEVIVSAGTIGTPQLLLLSGVG 

PESYLSSLNITWQPNPWGQFWDNPRNFINILPPNPIEASVVTVLGIRSDYYQVSLS 

SLPFSTPPFSLFPTTSYPLPNSTFAHIVSQVPGPLSHGSVTLNSSSDVRIAPNIKFNY 

YSNSTDLANCVSGMKKLGDLLRTKALEPYKARDVLGIDGFNYLGVPLPENQTDDAS 

FETFCLDNVASYWHYHGGSLVGKVLDDSFRVMGIKALRWDASTFPYEPNSHPQG 

FYLMLGRYVGLQILQERSMO 



Figure 6 : Comparison of the amino acid sequences of 
Prunus amygdalus HNL5 and of the hybrid protein 
PamHNL5xGOX . Sequence, parts of Aspergillus niger 
glucose oxidase are underlined. Sequence regions having 
no significant homology between the two proteins are 
printed in italics, and the signal peptides are printed 
in bold type. 

PamHNLSGox 1 matllvsslv vslaaalo h v Irsng iea YNATOTSS 

PamHNLS 1 mekstmsvil Mhtlvtht qysevhstan tsahdfsylk /VYNATDTSS 

PamHNLSGox 37 EGSYDYIVIG GGTSGCPLAA TLSEKYKVLL LERGTIATEY PNTLTADGFA 
PamHNLS 51 EGSYDYIVIG GGTSGCPLAA TLSEKYKVLL LERGTIATEY PNTLTADGFA 

PamHNLSGox 87 YNLQQQDDGK TPVERFVSED GIDNVRARIL GGTTUNAGV YARANISFYS 
PamHNL5 101 YNLQQQDDGK TPVERFVSED GIDNVRARIL GGTTUNAGV YARANISFYS 

PamHNL5Gox 137 QTGIEWDLDL VNKTYEWVED AIWKPNNQS WQSVIGEGFL EAGILPDNGF 
PamHNL5 1 51 QTGIEWDLDL VNKTYEWVED AIWKPNNQS WQSVIGEGFL EAGILPDNGF 

PamHNLSGox 1 87 SLDHEAGTRL TGSTFDNNGT RHAADELLNK GDPNNLLVAV QASVEKILFS 
PamHNLS 201 SLDHEAGTRL TGSTFDNNGT RHAADELLNK GDPNNLLVAV QASVEKILFS 

PamHNL5Gox 237 SNTSNLSAIG VJYTDSDGNS HQAFVRGNGE VIVSAGTIGT PQLLLLSGVG 
PamHNLS 251 SNTSNLSAIG VIYTDSDGNS HQAFVRGNGE VIVSAGTIGT PQLLLLSGVG 

PamHNLSGox 287 PESYLSSLNI TVVQPNPYVG QFVYDNPRNF INILPPNPIE ASWTVLGIR 
PamHNLS 301 PESYLSSLNI TVVQPNPYVG QFVYDNPRNF INILPPNPIE ASWTVLGIR 

PamHNL5Gox 337 SDYYQVSLSS LPFSTPPFSL FPTTSYPLPN STFAHIVSQV PGPLSHGSVT 
PamHNLS 351 SDYYQVSLSS LPFSTPPFSL FPTTSYPLPN STFAHIVSQV PGPLSHGSVT 

PamHNLSGox 387 LNSSSDVRIA PNIKFNYYSN STDLANCVSG MKKLGDLLRT KALEPYKARD 
PamHNL5 401 LNSSSDVRIA PNIKFNYYSN STDLANCVSG MKKLGDLLRT KALEPYKARD 

PamHNLSGox 437 VLGIDGFNYL GVPLPENQTD DASFETFCLD NVASYWHYHG GSLVGKVLDD 
PamHNLS 451 VLGIDGFNYL GVPLPENQTD DASFETFCLD NVASYWHYHG GSLVGKVLDD 

PamHNL5Gox 487 SFRVMGIKAL RWDASTFPY EPNSHPQGFY LMLGRYVGLQ ILQERSmg- 
PamHNL5 501 SFRVMGIKAL RVVDASTFPY EPNSHPQGFY LMLGRWGLQ ILQERS/r/e 

PamHNL5Gox 535 

PamHNL5 551 aihniqesm 



V 
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Figure 8 : 

Nucleotide sequence of the Prunus amygdalus HNL1 gene 
obtained by PCR amplification. 

ATGGAGAAATCAACAATGTCAGCTATACTGTTGGTGTTATACATTTTTGTCCTCC 
ATCTTCAATATTCTGAGGTCCACT 

CGCTTGCCACGACTTCTGATCATGgtaaatcacttcaaccgtaattcaaaacaccaaaaagg- 
caatcaaaaagaaaacg 

gaaaaaagtgtaagaaaagcagatatagacgcctgcatagatgcatgtgctatatacttttaaaaactcttcgtctctt 

gagattttgcagATTTTAGCTACCTGAGCTTTGCATACGACGCCACTGATCTA- 

GAGTTGGAAGGATCATATGACTACGT 

TATAGTTGGCGGAGGAACATCAGGGTGTCCATTGGCAGCAACTTTATCAGAAAA 
ATACAAGGTGCTCGTTCTCGAAAGG 

GGCAGTCTTCCGACAGCATATCCGAACGTCTTGACTGCAGATGGGTTTGTATAT 
AATCTCCAGCAAGAAGATGATGGAA 

AGACACCGGTCGAAAGGTTCGTGTCCGAAGATGGTATTGATAATGTACGGGGC 
AGGGTGCTCGGTGGCACAAGCATTAT 

CAATGCCGGTGTCTACGCCAGAGCTAACACCTCAATCTATAGTGCATCAGGAGT 
TGATTGGGACATGGATTTGGTTAAT 

CAGACATATGAGTGGGTTGAAGACACTATTGTGTACAAGCCAAATTCTCAATCTT 
GGCAGTCTGTTACAAAAACTGCAT 

TCTTGGAGGCTGGTGTTCATCCAAACCATGGATTTAGTTTAGATCATGAAGAAG 
GAACTAGAATTACCGGCTCAACTTT 

TGACAACAAGGGAACGAGACATGCAGCTGATGAACTTCTTAATAAAGGAAACTC 
TAACAACTTGCGAGTTGGAGTTCAT 

GCCTCAGTAGAGAAGATCATCTTCTCCAATGCACCAGgtatgttgcatcatgcactccaa- 
aattaatattttgtcattt 

taaaacactagcaggagccaaggtctggaagtacgaataaaatttcattattttccttggatttgtttgataatgatta 
taagcttttctgtaatgtagGTTTGACAGCTACAGGAGTCATATATAGGGATTCTAATG- 
G AAC G C CTC AC CAAG CATTT 

GTACGCAGTAAGGGAGAAGTTATCGTGAGTGCAGGGACAATTGGGACCCCTCA 
ACTTCTACTACTTAGCGGTGTTGGGC 



CAGAGTCTTACCTATCATCTCTAAATATTCCAGTTGTTCTTTCCCATCCTTACGTC 
GGACAGTTTCTGCATGACAATCC 

TCGTAATTTCATTAACATTTTGCCCCCAAATCCAATTGAACCCACAATTGTAACTG 
YTCTAGGCATTTCAAACGATTTC 

TACCAATGTTCTTTCTCGAGCTTGCCATTTACAACTCCACCCTTCGGTTTT^ 
CTAGTGCATCTTATCCCCTGCCAA 

ATTCGACTTTTGCTCACTTTGCTAGCAAAGTGGCAGGACCTTTATCATATGGTTC 
TCTCACACTGAAATCATCCTCCAA 

TGTGAGAGTCAGTCCAAATGTCAAATTTAATTACTATTCAAATCTGACAGATCTTT 
CTCATTGTGTTAGCGGCATGAAG 

AAGATTGGTGAACTCTTGAGCACAGACGCATTAAAACCATATAAAGTTGAAGATT 
TGCCGGGTGTAGAAGGTTTTAATA 

TTTTGGGAATCCCTTTGCCAAAGGACCAAACAGATGATGCAGCCTTCGAAACAT 
TTTG C C G AG AATC AGT AG C CTC ATA 

TTGGCACTACCACGGTGGATGCCTTGTTGGAAAGGTGCTTGATGGTGATTTCCG 
TGTTACAGGGATCAATGCATTACGC 

GTTGTTGATGGCTCAACATTCCCTTACACACCAGCGAGCCACCCTCAGGGCTTC 
TATCTGATGTTAGGGAGgtatgtta 

caaattctcastaattatttggttgagtggcitgtttcfaatgaactctatgccatatttctctttctcatcctttcca 
tttttgtgccatgggcagGTATGTGGGCATTAAAATTCTGCAAGAAAGATCAGCTTCA- 
GATCTAAAAATCTTGGATTCC 
CTCAAGTCAGCAGCATCCTTGGTTCTTTAAACT 



Figure 9 : 



Amino acid sequence of Prunus amygdalus hydroxynitrile 
lyase (HNL1) , derived from the nucleotide sequence of 
the HNL1 gene . 



MEKSTMSAILL VL YIFVJ.HLQ YSE VHSLA TTSDHDFSYLSFA YDA TDLELEGSY 
DYVI VGGGTSGCPLAA TLSEKYKVL VLERGSLPTA YPN VI. TADGFVYNLOQE 
DDGKTPVERFVSEDG1DNVRGRVLGGTSUNAGVYARANTSIYSASGVDWDM 
DL VNOTYEWVEDTIVYKPNSOSWQSVTKTAFLEAGVHPNHGFSLDHEEGTRI 
TGSTFDNKGTRHAADELLNKGNSNNLRVGVHASVEKIIFSNAPGLTATGVJYR 
DSNGTPHQAFVRSKGE VIVSAGT1GTPOLLLLSG VGPESYLSSLN1P VVLSHPY 
VGQFLHDNPRNFINILPPNPlEPTIVrVLGJSNDFYQCSFSSLPFTTPPFGFFPS 
ASYPLPNSTFAHFASKVAGPLSYGSLTLKSSSNVRVSPNVKFNYYSNLTDLSHC 
VSGMKKIGELLSTDALKPYK VEDLPG VEGFN1LGIPLPKDQTDDAAFETFCR 
ESVASYWHYHGGCLVGKVLDGDFRVTG1NALRWDGSTFPYTPASHPQGFYL 
MLGRYVGIKILOERSASDLKILDSLKSAA SL VL 



